BUPT at TREC 2006: Enterprise Track

نویسندگان

  • Zhao Ru
  • Quian Li
  • Weiran Xu
  • Jun Guo
چکیده

This year, the expert search task requires a list of support documents provided for each expert. The change implied that support documents for the potential experts should be found before getting the experts themselves, which is one of the natural ways for expert search. The two-stage ranking method we used last year was just following this way. We develop an expert experience model using window-based method this year, in which our efforts were focused on the combination of using local content for evidence and quoting entire document for support. We also tried to treat some important types of data particularly both in the corpus and in a document. Finally the headings in every page were given a high weight. Each email author was given an additional weight for the confidence of their relationship with the email content. All our experiments were based on the 4.2version of Lemur Toolkit, in which language model with Bayesian smoothing was used for relevance computing. For candidate location, the candidate list and the name disambiguation rules[1] used last year were still working this time. But we found there were some problems in encoding which would cause missing match for a few candidates. We accepted several encoding representation in our system. The detail of the expert experience model and some improvements are in the following analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BUPT at TREC 2006: Spam Track

This report summarizes our participation in the TREC 2006 spam track, in which we consider the use of Bayesian models for the spam filtering task. Firstly, our anti-spam filter, Kidult, is briefly introduced. And then we try to use weighted adjustment of separating hyperplane and selective classifiers ensemble to improve the filtering performance. Finally, we summarize the relevant results from...

متن کامل

TREC 2005 Enterprise Track Experiments at BUPT

This paper introduces and analyzes some experiments to find valid methods and features in enterprise search. For this purpose, two main experiments have been done. One is to retrieve some emails which contain the required information in all the emails of an enterprise, and the other is to try to find some experts who are helpful in a particular fields. Some features of the intranet dataset, suc...

متن کامل

BUPT at TREC 2009: Entity Track

This report introduces the work of BUPT (PRIS) in Entity Track in TREC2009. The task and data are both new this year. In our work, an improved two-stage retrieval model is proposed according to the task. The first stage is document retrieval, in order to get the similarity of the query and documents. The second stage is to find the relationship between documents and entities. We also focus on e...

متن کامل

TREC 2006 at Maryland: Blog, Enterprise, Legal and QA Tracks

In TREC 2006, teams from the University of Maryland participated in the Blog track, the Expert Search task of the Enterprise track, the Complex Interactive Question Answering task of the Question Answering track, and the Legal track. This paper reports our results.

متن کامل

Beijing University of Posts and Telecommunications(BUPT) at TREC 2016: A Rating Model Based on Tags for ABSTRACT Contextual Suggestion

In this paper we focus on the effort of Beijing University of Posts and Telecommunications (BUPT) on the TREC 2016's Contextual Suggestion Track. The problem we are supposed to tackle is how to make suggestions for a particular person with the provided context as well as its preferences. Basically we regard tags as the most important factor, and get ratings for different attractions with the ra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006